A Wavelet Transform Based Protein Sequence Similarity Model

نویسندگان

  • Jie Su
  • Junpeng Bao
چکیده

Protein sequence analysis is an important tool for researchers to study on bio-informatics and molecular biology, such as proteins structure and function prediction, phylogenetic classification and different conservation pattern recognition. It is a significant open issue to quickly efficiently find the similar proteins from a large scale of protein repository. This paper proposes a new method based on Discrete Wavelet Transform (DWT) to measure the similarity of protein sequences, i.e. the ACDWT model, as well as two amino acid encoding methods (HPC and ADCC) according to hydropathy properties and dissociation constants respectively. The model employs only the approximation coefficients of DWT so that the feature vector is short. That brings the proposed model a great running time promotion. According to the phylogenic trees about nine ND5 proteins made from our model and others, the experimental results show that our model is efficient and a little better than the others.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting allergenic proteins using wavelet transform

MOTIVATION With many transgenic proteins introduced today, the ability to predict their potential allergenicity has become an important issue. Previous studies were based on either sequence similarity or the protein motifs identified from known allergen databases. The similarity-based approaches, although being able to produce high recalls, usually have low prediction precisions. Previous motif...

متن کامل

Protein sequence comparison based on the wavelet transform approach.

A protein's chemical properties, the chain conformation, the function of the protein and its species specificity are determined by the information contained in the amino acid sequence. Proteins of similar functions have at some level sequential identical amino acid sequences. The closer the phylogenetic relationship, the more similar are the sequences. To find the similarities between two or mo...

متن کامل

A combined Wavelet- Artificial Neural Network model and its application to the prediction of groundwater level fluctuations

Accurate groundwater level modeling and forecasting contribute to civil projects, land use, citys planning and water resources management. Combined Wavelet-Artificial Neural Network (WANN) model has been widely used in recent years to forecast hydrological and hydrogeological phenomena. This study investigates the sensitivity of the pre-processing to the wavelet type and decomposition level in ...

متن کامل

Analysis of Protein Sequences Using Time Frequency and Kolmogorov-Smirnov Methods

The plethora of genomic data currently available has resulted in a search for new algorithms and analysis techniques to interpret genomic data. In this two-fold study we explore techniques for locating critical amino acid residues in protein sequences and for estimating the similarity between proteins. We demonstrate the use of the Short-Time Fourier Transform and the Continuous Wavelet Transfo...

متن کامل

Comparative Analysis of Image Denoising Methods Based on Wavelet Transform and Threshold Functions

There are many unavoidable noise interferences in image acquisition and transmission. To make it better for subsequent processing, the noise in the image should be removed in advance. There are many kinds of image noises, mainly including salt and pepper noise and Gaussian noise. This paper focuses on the research of the Gaussian noise removal. It introduces many wavelet threshold denoising alg...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013